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Abstract 

This paper presents an improved result on the negative-binomial Monte 
Carlo technique analyzed in a previous papei0 for the estimation of an un- 
known probability p. Specifically, the confidence level associated to a relative 
interval [p//^2, ppi]) with /ii, /i2 > 1, is proved to exceed its asymptotic value 
for a broader range of intervals than that given in the referred paper, and for 
any value of p. This extends the applicability of the estimator, relaxing the 
conditions that guarantee a given confidence level. 
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1 Introduction 

Monte Carlo (MC) methods are widely used for estimating an unknown parameter 
by means of repeated trials or realizations of a random experiment. An important 
particular case is that in which the parameter to be estimated is the probability p of 
a certain event -ff , and realizations are independent. In this setting, the technique of 
negative-binomial MC (NBMC) [1] can be used. This technique applies a sequential 
stopping rule, which consists in carrying out as many realizations as necessary to 
obtain a given number N of occurrences of H . Based on this rule, an estimator is 
introduced in [T], and it is shown to have a number of interesting properties, in the 
form of respective bounds for its bias, relative precision, and confidence level for 
a relative interval [p/m2, PMi]; Mi, M2 > 1- Specifically, regarding the latter, it is 
derived in [T] that the confidence level c = Vv[p/ ^2 < P < PMi] asymptotic 
value c as p ^ 0, given bjH 

c = ^{N,{N -l)^l2)-l[N,^^^^ . (1) 

Furthermore, the confidence level c is assured to exceed c for 



N + ^/N N -I 
M2 > — — —, Ml > 1=. (2) 
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In this paper, the sufficient conditions that assure a confidence level c > c for 
the NBMC estimator are relaxed in two ways: 

• The restriction on p given by ([3]) is eliminated, i.e. p can be an unconstrained 
value between and 1. 

• The condition for fxi given by ^ is weakened, while maintaining the condition 
for fi2- Thus fj,i can be further decreased while having the same guaranteed 
confidence level given by H]). 

The result is presented in Section [21 and conclusions are given in Section [31 



2 Result 



Consider a random experiment, and an event H associated to that experiment (more 
generally, there may be a set of events associated to the experiment, one of which 
is of interest). The probability p of event H is to be estimated from independent 
realizations of the experiment, using the method described in [1]. Specifically, giveifl 
iV G N, with > 3, realizations are carried out until N occurrences of H are 
obtained. The number of realizations is thus a negative-binomial random variabl^ 
n, from which p is estimated as [T] 



P 



N - 1 



n 



(4) 



For /ii, //2 > 1 given, consider the interval [p//X2, p/ii], and its associated con- 
fidence c = Pr[p/^2 < P < PfJ-i]- As shown in [1], c tends to c given by H]) as 
p^O. 

Proposition 1. For any p e (0, 1), with p given by (|4]), the lower bound c > c 
holds if 

n + Vn n-1 

> — TT — Ml > 



N - 1 



Proof. Consider > 3, /ii, ^2 > 1, and p E (0, 1) 

^N- 1" 

ni 



Let us define 



"2 = 



PMi 

{N - l)^l2 
p 



(5) 

(6) 
(7) 



The confidence c is given by 1 — ci — C2 witlj§ 



Ci = Pr[n < ni — 1] 



{N 



-N 



(8) 



n=N 



^ [-J and [■] respectively denote rounding to the nearest integer towards — oo and towards oo. 

*N denotes the set of natural numbers, {1, 2, . . . }. 

^Random variables are denoted in boldface throughout the paper. 

'^The following notation is used; A;(') = k{k - 1) ■ ■ ■ (k - i + 1), fc(0) = 1. 
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C2 = Pr[n > ^2 + 1] 



(9) 



(TV - 1)! 

Let ci and 62 be respectively defined as linip^o Ci and linip^o C2 • From [J appendix 
C], ci = 7(7V, {N-l)/fj.i) and C2 = l-7(iV, [N - IW). We will show that ci > ci 
and C2 > C2 for /xi, ^2 as in ([5]). This will estabHsllO that c > c. 

The inequality C2 > C2 is equivalent to Pr[rj. < 77.2] > linip^o Pr[''^ ^ '^2], which 
is established in [H appendix C] for /i2 as in ((S]) and 712 given by ([71) • 

In order to show that Ci > Ci, we first note that 



JV-l 



{N-IV-Jn-i 



(10) 



Lemma [T] given in the Appendix implies that the right-hand side of pop will be 
greater than or equal to that of ([8]) if rii - 1 < (iV - 1/2 - ^/N - 1/2) /p + 1/2. 
From Til is upper bounded by (N — l)/(p/ii) + 1. Therefore, in order to assure 
that ci > ci it is sufficient that 



< — — - + i (11) 



or equivalently 



Pfii p 2 



Ml > ^ , ^ • (12) 



Since p> 0, IE]) holds for ^1 as in (O. □ 

This result removes some of the restrictions that are used in to assure that 
c > c. Specifically, p can take any value, and the minimum required value for /ii is 
lower. 

For the particular case that /ii = /X2 = 1 + ni, where to > is a relative error 
margin, it is easily seen that the limiting condition in (O is that for ^2, i-e. 

m>^;^. (13) 

The dashed curves in Fig. [T] depict the guaranteed confidence c (given by ([T])) as a 
function of N and to, for m within the allowed range (|13p . The solid line represents 
the minimum confidence c„ii,i that can be guaranteed as a function of to. This 
corresponds to the lowest TV permitted by (|13p for a given to; increasing N gives 
larger guaranteed confidence levels. 

The achievable region in the (to, c) plane is that above the solid curve, in the 
following sense: for any (to, c) within this region, the confidence level associated to 
the error margin to can be assured to be greater than c, irrespective of p; this is 
accomplished by selecting N according to ([1]) (or, equivalently, using the curves in 



^It should be noted that, although [1, appendix C] considers n2 > a, the actual range of values 
for n2 is 712 > a — 1. Nonetheless, the proofs in [1, appendix C] can be readily generalized to 
722 > a — 1. 
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Figure 1: Guaranteed confidence c and minimum confidence that can be guaranteed 

Cm in 

[U fig. 5(a)]). Comparing with [TJ fig. 4], it is seen that Proposition [T] enlarges the 
achievable region, specially for low to; besides, it removes the restriction on p. Fig.[T] 
thus replaces [2 fig. 4(a)]; and [TJ figures 4(b) and 5(b)] are no longer necessary. 

As an example, consider the following problem: given an NBMC estimator p of 
an unknown p with N = 30, find the minimum to such that Pr[p/(1 + to) < p < 
p{l + m)] > 75%. From Proposition [1] and ((1]), to = 23.7%. The results in [1], on 
the contrary, can only give to — 24.5%, since (23.7%, 75%) is not in the achievable 
region according to |Jj fig. 4(a)]; besides, p < 0.224 is required. 



3 Conclusions 

In this paper, the statistical characterization of the NBMC estimator introduced in 
[T] has been improved by relaxing the conditions that guarantee a certain confidence 
level. It has been established that, for p E (0, 1) arbitrary, the NBMC estimator 
has a confidence level better than ([T]) provided that fii, ^2 satisfy ([5]). This result 
extends the range of application of the NBMC estimation technique. 

A Appendix 

The following lemma, used in the proof of proposition [1] is now established. 
Lemma 1. Given N, n* £ N with N > 3 and 



n-\-Jn^\ 1 
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the curve is convex 




N (N-^-^lN-^)/ p rf 



Figure 2: Graphical representation for the proof of Lemma [T] 

the following inequality holds: 

/■"* "* 

/ n^-ie-f"dn> V(n-l)(^-i)(l-p)"-^. (15) 

Proof. We first note that the sub-integral function is increasing for n < {N — l)/p, 
and is convex for {N — l — y/N~ < n < {N — l)/p. As figure [2] illustrates, each 
term of the sum in ([T5)) can be identified with the area of a rectangle of unit width. 
Specifically, the term corresponding to a given n is associated to the rectangle that 
extends horizontally from n — 1 to n in the figure. In addition, for the rectangles 
situated to the right of {N — 1 — y/ N — l)/p (where the sub- integral function is 
convex), the flat tops are replaced by straight lines joining the centers, without 
altering the total area. The inequality p^)) will hold if the area below the curve 
in the interval {N — 1, n*) is larger than the shaded area in the figure. We divide 
this interval in two: (A^ - 1, (A^ - 1 - VN^)/p) and ((iV - 1 - y/N^)/p, n*), 
and require that in each of these intervals the area of the curve be larger than the 
part of the shaded area corresponding to that interval. In the first interval, since 
the curve is increasing, it suffices that the curve be above the square marks for 
n < L(-^ — 1 — \/N — 1) /p\ , as shown in the figure. In the second interval, since the 
curve is increasing and convex, it suffices that the curve be above the square mark 
located at [(A^ — 1 — y/N — l)/p\ + 1 and above the triangle marks. As a result, to 
establish it is sufficient that 

(i) the sub-integral function be above the square marks for iV < n < {N — 1 — 
y/irn:)/p+l; and 

(ii) the sub-integral function be above the triangle marks for {N—l — ^N — 1) /p+ 
1 < n < n* . 

We analyze these conditions separately. 
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(i) With X defined as 



1 ^^^_^^N-l^-{n-l)p 

X — — in 



p ^-^ \ n — 1 



n- N 
P 



(16) 



ln(l-p)-(n-l), 



condition (i) is expressed as x > for n < {N — 1 — \/N — + 1. Defining 
M = N -I and v ^ {n - l)p, 



^ i=l 



W-1 







)-; 









Using the Taylor expansion ln(l ~ t) — — 1^1 ^ 1' fT7|) is transformed 

into X = J^'jLo •'^jP' with 



N-l 



M 



J + 2 J + 1 



(18) 



The term xq is easily seen to be nonnegative for v < M — \/ M, i.e. for n < {N — 
l-y/N^)/p+l. We now prove that the remaining coefficients Xj , j > 1 are also 



nonnegative for v < M — \JM. 

We begin with the case N > 5, j > 2. Using the inequality 



JV-l 



^ (i - 1)^ + ^ > / (l - l)^ + ldi 

i=l 

_ (A^ - 2)^+2 _ (M - 1)^+2 



(19) 



J +2 



J +2 



in pB]). we can bound 



(Af - 1)^+2 + (j + l)z/J+2 - (j + 2)Mi.J'+i 



(20) 



(j + l)(j- + 2)i/^+i 

The denominator in is positive. Let yj denote the numerator. We compute 



^ = {j + 2){j + !){,.- My <0, 



(21) 



from which it suffices to consider v = M — y'M. Expressing 

i+i 



J+2 



(22) 



+ (j + 1) 1 



(j + 2) 
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and denoting the bracketed term by Yj , we now compute the following partial deriva- 
tives as if j were a continuous variable: 



1 

In ( 1 



1 \ 1 



J+2 



(23) 



\/I7/ ^fM 

The right-hand side of (|24|) is positive, and using the inequality ln(l -I- i) > t — t2/2 



^'^^ = fl--^) (1 + ^) + (24) 



we can bound (|23|) for j = 2 as 

5i ,=2 



M) \ ^JMJ VM ^25) 
2Vm J VM 



Therefore the right-hand side of ([23|) is positive for j > 2. Consequently, in order to 
establish that yj > 0, it suffices to show that Y2 > 0. Defining m = Y2 can 

be expressed as —m^im? + im? -\- 2m — 2). According to Descartes' sign rule, this 
polynomial has only one positive root. For m — > cx) the polynomial takes negative 
values, and for m = 1/2 it is positive. Therefore, it is positive for m < 1/2, i.e. for 
M > 4, or iV > 5. 

For N = 4, j > 2, we have 



_ (j + 2)(1 + 2^+1) + U + - -Sjj + 2)iy^+^ 

(j + 2)(j + l)i^^+i 



(26) 



Let Zj denote the numerator in (|26p . Since 



^ = {j + 2){j + iy{i.^3)<0, (27) 

it suffices to consider ;/ = 3 — V3- Bounding zj as 

z, > {j + 2)(2^"+i - + U + (28) 

^iL=3-\/3 seen to be positive for j > 2. 
For = 3, j > 2, 



_ j + 2 + {j + - 2{j + 2y+^ 

(j + 2)(j- + 1)^.^+1 

and similar arguments to those for = 4, j > 2 show that Xj > 0. 
For A^ > 3, j = 1, using the identity 

n2 (^V-2)3 nV-2)2 A^-2 
(M-l)3 (M-l)2 M~l 



(29) 



(30) 
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we obtain from 



4i/3 - 6Mi/2 + 2(Af - 1)3 + 3(M - 1)^ + Af - 1 

= j^^;^ ■ (31) 

From Descartes' sign rule, the numerator of (j3ip considered as a polynomial in 
has two positive roots at most. This polynomial is positive for v ^ and for 
1/ — > oo, and negative for v = M. Thus, it will be positive for < M — \fM if it 
is for V = M — \fM. Substituting this v and defining m = \fM^ the numerator is 
expressed as m^(3m^ — 4m + 1), which is positive for m > 1, or equivalently M > 1, 
and thus for > 3. 

(ii) With x' defined as 

a: = - In ^ ^' 



N 



y In 1 - ^ (32) 
_!i_iXln(l-p)-(^„-i 

and taking into account HH), in order to fulfil condition (ii) it is sufficient that 
x' > for n < (TV - 1/2 - y^N - l/2)/p + 1/2. Defining M' = N - 1/2 and 
i^' = {n- l/2)p, 



(33) 




P \P 

Proceeding as with a;, we can express x' — X^j^o ^'jP^ with 



-Eh-o +T-,-—-y (34) 



(j + l)z/'J+i ^ V 2/ J + 2 j + 1 



Xq is seen to be nonnegative for v' < M' — \J M' — 1/4, and thus for v' < M' — \/ M', 
i.e. for n < {N — 1/2 — \/ N — \/2)/p + 1/2. We now prove that the remaining 
coefficients x'p j >1 are also nonnegative for v' < M' — V M'. 
We begin with the case N > 5, j > 2. Using the inequality 



N-l y / j^XJ + l 



(35) 



j + 2 J +2 

we can bound 

, {M' iy+' + (j + iy^+^ - u + 2)Af 

(, + l)(j + 2)^0+1 • W 
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The denominator in ((36)) is positive, and the numerator is as that in ((20l) with M 
replaced by M' and v replaced by v' . It stems that x'^ > for M' > 4, i.e. > 5, 
and j > 2. 

For N ^ A, j >2, ^ gives 



(37) 



ij + 2) {ihY^" + ilY^" + ilY^") + U + 1)-'^+' 



which is shown to be positive with analogous arguments as for Xj. 
For = 3, j > 2, 



-|(j + 2)z.'^+i]/[(j + 2)(j + l)j.'^+i], 



and similarly it is shown to be positive. 
For > 3, j = 1, using the identity 



N-l 

E 

i=l 



N 



(iV- 



(M' - 1)^ (M' - ly 



6 
M' 
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(38) 



(39) 



we obtain an expression for x'^ which coincides with (PT|) replacing M by M' and i' 
by v' . Therefore, x'j is positive for M' > 1, and thus for > 3. 

According to the foregoing, conditions (i) and (ii) hold for A^ > 3. This estab- 
lishes the stated result (ITSI) . □ 
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